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310 
URL TABLE 



ID 

PROTOCOL NAME 
IP ADDRESS 
DOMAIN NAME 
PORT NUMBER 
DIRECTORY PATH 
RESOURCE NAME 



320 

PARENT CHILD TABLE 

PARENT URL ID 
CHILD URL ID 



330 

METADATA TABLE 

URL ID 
TAG 
NAME 
VALUE 




340 

SUBSCRIBER TABLE 
ID 

NAME 

E-MAIL ADDRESS 



350 

AUTHOR TABLE 

URL ID 
NAME 

E-MAIL ADDRESS 
SUBSCRIBER ID 



360 

HEURISTIC TABLE 
URL ID 

E-MAIL ADDRESS 




FIG. 3 
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PROCESS 
500 



^ START ^ 



502 

RECEIVE METADATA FROM 
THE WEBCRAWLER 



504 

STORE THE METADATA IN 
THE DATABASE 



506 

RETRIEVE A URL FROM THE 
DATABASE 



508 

CONNECT TO THE URL 



510 

WAIT FOR A WEB SERVER 
RESPONSE CODE 




Yes 



Q END ^ 



FIG. 5A 
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FROM STEP 512, "NO" PATH 



552 

RETRIEVE EVERY 
PARENT URL 



514 

PROCESS THE 
ERRONEOUS URL 



554 

USE THE METADATA TO 
DETERMINE ACTIONS THAT 
WILL CORRECT THE URL 



556 

CREATE AN ELECTRONIC 
MAIL MESSAGE 



558 

RETRIEVE E-MAIL ADDRESS 
FOR WEB PAGE AUTHOR 




562 

APPLY HEURISTICS TO 
DETERMINE E-MAIL 
ADDRESS 



Yes 




568 
NOTIFY WEB 
PAGE AUTHOR 



566 
UPDATE THE 
SUBSCRIBER DATABASE 



No 



! 



T 



TO STEP 516 



FIG. 5B 



